PromDrum — Exploiting the prosody-gesture link for intuitive, fast and fine-grained prominence annotation
نویسندگان
چکیده
Most prominence annotation methods have certain drawbacks. Simple binary scales may be too coarse to capture fine-grained prominence differences, and multi-level annotation schemes have been shown to be time-consuming and difficult to use for non-expert annotators. This study proposes a novel method for fine-grained and fast prominence annotation by exploiting the prosody-gesture link. On a sentence-by-sentence basis, native German participants were instructed to listen to audio recordings and reiterate them by beating on an electronic drum pad either once per syllable (experiment 1) or once per word (experiment 2), modulating the strength of each beat according to how strongly the syllable or word stood out in the sentence. The velocity profiles of MIDI outputs were then interpreted as correlates of perceived prominence and compared with fine-grained prominence ratings by three expert annotators. While wordlevel drumming showed high correlations to conventional ratings for some of the subjects, inexperienced participants often had considerable difficulty performing the task. Syllable-level drumming, on the other hand, proved to be a time-efficient and intuitive method for experienced and naive subjects alike. Especially by pooling velocity results from several participants to create mean values, it was possible to maintain high levels of correlation with expert prominence ratings.
منابع مشابه
Beat it! – Gesture-based Prominence Annotation as a Window to Individual Prosody Processing Strategies
In recent work [1], we have suggested a novel approach for fine-grained and fast prominence annotation by naı̈ve listeners. Our approach relies on annotators’ “drummed” replications of a perceived utterance, modulating their drumming velocity in accordance with the perceptual prominence of consecutive linguistic units (syllables, words). The drumming velocity is then used as a fine-grained opera...
متن کاملAutomatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis
This paper describes work directed towards the development of a syllable prominence-based prosody generation functionality for a German unit selection speech synthesis system. A general concept for syllable prominence-based prosody generation in unit selection synthesis is proposed. As a first step towards its implementation, an automated syllable prominence annotation procedure based on acoust...
متن کاملTwo Dimensions of Prominence
Prosody fulfills a variety of functions in dialogues. Our study examines the relationship between different levels of perceived prominence of syllables and the linguistic and paralinguistic categories accent and emphasis which are conveyed prosodically. It is still unclear, how a notational system might look like that is able to capture the fine–grained differences between both. The notion of p...
متن کاملRelations between prominence and articulatory-prosodic cues in emotional speech
This study investigates the relations between the degree of prominence and articulatory-prosodic cues in emotional speech. In particular, this study considers articulatory parameters driven from the Converter/Distributor (C/D) model. The goal is to obtain a better understanding of the link among syllable magnitude in the C/D model, the empirical way to measure it in literature, and syllable-lev...
متن کاملA preliminary study of the temporal relationship between prosody and gesture in Hong Kong Cantonese
Previous studies of speech and gesture in intonational languages generally suggested that prosodic and gestural prominence are aligned with one another, pitch accented/ stressed syllable or the peak fundamental frequency (F0) of it being the prosodic anchor. A logical question to raise would be whether such alignment exists in tonal languages without lexical stress. To answer the question, this...
متن کامل